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Abstract 

Felix-Medina and Thompson (2004) proposed a valiant of link-tracing sampling to 
estimate the size of a hidden population such as drug users, sexual workers or homeless 
people. In their valiant a sampling frame of sites where the members of the population 
tend to gather is constructed. The frame is not assumed to cover the whole population, 
but only a portion of it. A simple random sample of sites is selected; the people in 
the sampled sites arc identified and arc asked to name other members of the population 
which arc added to the sample. Those authors proposed maximum likelihood estimators 
of the population size which derived from a multinomial model for the numbers of people 
found in the sampled sites and a model that considers that the probability that a person is 
named by any element in a particular sampled site (link-probability) does not depend on 
the named person, that is, that the probabilities arc homogeneous. Later, Felix-Medina 
et al. (2015) proposed unconditional and conditional maximum likelihood estimators of 
the population size which derived from a model that takes into account the heterogeneity 
of the link-probabilities. In this work we consider this sampling design and set condi¬ 
tions for a general model for the link-probabilities that guarantee the consistency and 
asymptotic normality of the estimators of the population size and of the estimators of the 
parameters of the model for the link-probabilities. In particular we showed that both the 
unconditional and conditional maximum likelihood estimators of the population size are 
consistent and have asymptotic normal distributions which arc different from each other. 

Key words: Asymptotic normality, capture-recapture, chain referral sampling, hard-to- 
detect population, maximum likelihood estimator, snowball sampling 
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1 Introduction 


Conventional sampling methods are not appropriate for sampling hidden or hard-to-reach 
human populations, such as drug users, sexual-workers and homeless people, because of the 
lack of suitable sampling frames. For this reason, several specific sampling methods for this 
type of population have been proposed. See Magnani et al. (2005) and Kalton (2009) for 
reviews of some of them. One of this methods is snowball sampling, also known as link¬ 
tracing sampling (LTS) or chain referral sampling. In LTS an initial sample of members 
of the population is selected and the sample size is increased by asking the people in the 
initial sample to name other members of the populations. The named people who are not in 
the initial sample are added to the sample and they are asked to name other members of the 
population. The sampling process might continue in this way until a stopping rule is satisfied. 
For reviews of several variants of LTS see Spreen (1992), Thompson and Frank (2000) and 
Johnston and Sabin (2010). 

Felix-Medina and Thompson (2004) proposed a variant of link-tracing sampling (LTS) 
to estimate the size of a hidden population. In their variant they supposed that a sampling 
frame of sites where the members of the target population tend to gather can be constructed. 
As a examples of sites are public parks, bars and blocks. It is worth nothing that they do 
not supposed that the frame covers the whole population, but only a portion of it. Then an 
initial sample of sites is selected by a simple random sampling without replacement design 
and the members of the population who belong to the sampled sites are identified. Finally 
the people in the initial sample are asked to named other members of the population and the 
named persons who are not in the initial sample are included in the sample. Those authors 
proposed models to describe the number of members of the population who belong to each 
site in the frame and to describe the probability that a person is linked to a sampled site, that 
is, that he or she was named by at least one person who belongs to that site. From those 
models they derived maximum likelihood estimators of the population size. In that work 
those authors considered that the probability that a person is linked to a site (link-probability) 
does not depend on the person, but does on the site, that is, they consider homogeneous 
link-probabilities. 

Felix-Medina and Monjardin (2006) considered this same variant of LTS and derived 
estimators of the population size using a Bayesian-assisted approach, that is, they derived the 
estimators using the Bayesian approach, but the inferences were made under a frequentist 
approach. Those authors considered an homogeneous two-stage normal model for the logits 
of the link-probabilities. 

Later Felix-Medina et al. (2015) extended the work by Felix-Medina and Thompson 
(2004) to the case in which the link-probabilities are heterogeneous, that is, that they depend 
on the named people. Those authors modeled the heterogeneity of the link-probabilities by 
means of a mixed logistic normal model proposed by Coull and Agresti (1999) in the context 
of capture-recapture studies. From this model they derived unconditional and conditional 
maximum likelihood estimators of the population size. 

In this work we consider the variant of the LTS proposed by Felix-Medina and Thompson 
(2004) and a general model for the link-probabilities from which we derive the forms of the 
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unconditional and conditional maximum likelihood estimators of the population size. We 
state conditions that guarantee the consistency and asymptotic normality of both types of 
estimators, and we proposed estimators of the variances of the estimators of the population 
size. It is worth noting that our work is based on that by Sanathanan (1972) in which she 
derived asymptotic properties of both unconditional and conditional maximum likelihood 
estimators of the size of a multinomial distribution from an incomplete observation of the 
cell totals which is a situation that occurs in capture-recapture studies. Thus, our work is 
basically an adaptation of that by Sanathanan (1972) to the estimators used in the sampling 
variant proposed by Felix-Medina and Thompson (2004). 

The structure of this document is the following. In section 2 we describe the variant 
of LTS proposed by Felix-Medina and Thompson (2004). In section 3 we present proba¬ 
bility models that describe the numbers of people that belong to the sites in the frame and 
the probabilities of links between the members of the population and the sites. From these 
models we construct the likelihood function that allows us to derive the unconditional and 
conditional maximum likelihood estimators of the parameters of the assumed model for the 
link-probabilities and of the population size. In addition, we present conditions that guaran¬ 
tee the consistency of the proposed estimators. In section 4, which is the central part of this 
paper, we define the asymptotic framework under which are derived the asymptotic properties 
of the proposed estimators. In section 5 we proposed a method for estimating the variance- 
covariance matrices of the estimators of the different vectors of parameters that appear in the 
assumed models. Finally, in section 6 we discuss some points to be considered whenever the 
results of this paper want to be used in actual situations. 


2 Link-tracing sampling design 

In this section we will describe the LTS variant proposed by Felix-Medina and Thompson 
(2004). Thus, let U be a finite population of r people. Let U\ be the portion of U that is 
covered by a sampling frame of N sites A A N , which are places where members of 
the population tend to gather. We will assume that each one of the T\ persons who arc in U\ 
belongs to only one site A t in the frame. Notice that this does not imply that a person cannot 
be found in distinct places, but that, as in ordinary cluster sampling, the researcher has a 
criterion that allows him or her to assign a person to only one site. Let M % be the number of 
people in U\ that belong to the site Ai, i = 1,..., N. The previous assumption implies that 
t i = Mj. Letr 2 = t — T\ be the number of people that belong to the portion U 2 — U—U\ 
of U that is not covered by the sampling frame. 

The sampling procedure is as follows. An initial simple random sample without replace¬ 
ment (SRSWOR) Sa of n sites A \,.... A n is selected from the frame and the members of 
the population who belong to each sampled site are identified. Let S 0 be the set of people 
in the initial sample. Notice that the size of ,5' 0 is M = M, . Then from each sampled 
site Ai, i = 1 ,..., n, the people who belong to that site are asked to name other members of 
the population. A person and a sampled site are said to be linked if any of the persons who 
belong to that site names that person. Let Si and S 2 be the sets of people in U\ — S 0 and in 


3 



f/ 2 , respectively, who are linked to at least one site in Sa- Finally, from each named person 
the following information is obtained: the portion of U where that person is located, that is, 
U\ — Sq, Ai G Sa or C/ 2 , and the subset of sampled sites that are linked to him or her. 


3 Unconditional and conditional maximum likelihood esti¬ 
mators 

3.1 Probability models 

As in Felix-Medina and Thompson (2004), we will suppose that the numbers Mi ,..., Mn 
of people who belong to the sites Ai ,..., A N are independent Poisson random variables with 
mean Ai. Therefore, the joint conditional distribution of (Mi ,..., M n , Ti — M) given that 
Y^i Mi = Ti is multinomial with probability mass function (pmf): 

/(m t ,.... m n , r t - m|r.) = (±) (l ~ ■ d> 

To model the links between the members of the population and the sampled sites we will 
define for person j in C4 — Sq the vector of link-indicator variables ) = (x [^,..., ), 

where X i3 = 1 if person j is linked to site A, and X {] = 0 otherwise. Notice that Xj 
indicates which sites in Sa are linked to person j. We will suppose that given Sa, and 
consequently the values M t s of the sampled sites, the X^s are Bernoulli random variables 
with means s and that the vectors X^' 1 are independent. Let C2 = {(aq,..., x n ) : Xi = 
0,1; i — 1,..., n}, that is, the set of all the n-dimensional vectors such that each one of their 
elements is 0 or 1. For x = (aq,..., x n ) e Cl we will denote by the probability that the 
vector of link-indicator variables associated with a randomly selected person from Uk — Sq 
equals x, that is, the probability that the person is linked only to the sites Ai such that the /-1h 
element Xi of x equals 1. We will suppose that tTx 'depends on a q k -dimensional parameter 
Ok = • • •, 9q k J) G ©/:. C W k , that is, n^' 1 = n^\d k ), k = 1,2. In this work we will 

assume that 6 k does not depend on the observed M,s. 

Similarly, for person j in Ai G Sa, we will define the vector of link-indicator variables 
xJ Ai) = (x\f l \ ..., X^j, Xt+pj ,..., X^), where X-^ = 1 if person j is linked to site 
Ai>, i! — 1,.. ..,n, i! i and X^- = 0 otherwise. We will suppose that given Sa the X-^s 

are Bernoulli random variables with means s and that the vectors Xy ' 1 ^ are independent. 
For each A { G S A , let Q—* = {(xi,... i,x i+ i,... , x n ) : x v = 0,1; i’ ^ i,i’ = 1,.. .,n}, 
that is, the set of all (n — 1)-dimensional vectors obtained from the vectors in Q by omitting 
their i-th coordinate. For x = (xi,i,x i+ i,..., x n ) G C2_j we will denote by 7 
the probability that the vector of link-indicator variables associated with a randomly selected 
person from A, equals x. We will suppose that /^ ' ’depends on the qi -dimensional parameter 
0i = ... ,0^) G © 1? that is, 7Tx 4i) = 7r ^ i] (0i), i = l,...,n. 
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For instance, Felix-Medina and Monjardin (2006) modeled the link-probability between 


person j in U k — A, : and site A* e Sa^y pff =Pr[X^ > = 1 \Sa) = exp (a\ K> )/ 


where the conditional distribution of a 


y 

(*) 


T*). 


y 


.(*) 


1 + exp I a) 


(fc) 


given Ak is normal with mean ip k and variance 
al, which we denote by rs_/ N (i> k ,al) and i> k ~ N (/j, k ,^l). Thus, in this case 

0 k = {Uk, 7fc, cr fc ) G ©fc = M x (0, oo) x (0, oo), and 


7r. 


(fc) 


(Ofc) = 



x 



exp(a) 

1 + exp(a) 
1 


1 + exp(a 


/fc(a|V0/fc(V0da# 


-/fc(a|^)/fc(^)(iad^ 


n—t 


where x = (aq,..., x n )e fl,t— x i> an d ./fc(a|ri) and f k (v') denote the probability den¬ 
sity functions of the distributions N (riy, af.) and N (ji k . ; 7 %), respectively. It is worth noting 
that those authors did not compute 7 Tx\O k ) because they followed a Bayesian approach and 
focused on computing the posterior distribution of the parameters. 

As another example, Felix-Medina et al. (2015) modeled the link-probability between 


person j in U k — A t and site A, e S A by the following Rasch model: pff = Pr ( X K X = 


y 


TO_ 


y 


= exp iai*> + /3f 


/ 


1 + exp (af } + $ k) 


(k) 

, where a - is a fixed (not random) effect 


associated with the site A* and /3 ■ is a normal random effect with mean zero and variance 
(j^ associated with person j in U k — A*. Therefore 


7T. 


(fc) 


(»*) = / n 


exp 

Xi (a\ k) + cr k z^j 

1 + 

exp ( af’ + a k z J 


4>(z)dz, 


where x = (aq,..., x n )G Q, 0 k = (a[ k \ ..., off cr fc ) e © fc = M n x (0, 00 ) and 0(-) denotes 
the probability density function of the standard normal distribution. Those authors compute 
7T<*> (Ok) by means of Gaussian quadrature formula. 

Notice that in the first example the parameter 0 k is defined previously to the selection 
of the initial sample because the of ^s are a random sample from a probability distribution 
indexed by 6 k and consequently this parameter does not represent characteristics of the par¬ 
ticular selected sample. On the other hand, in the second example the parameter 6 k is defined 
once the initial sample of sites is selected because the af^s represent characteristics of the 
particular sites in Sa- Therefore, as long as 0 k does not depend on the the results derived 
in this work are valid for both cases. 


3.2 Likelihood function 

To compute the likelihood function we will factorize it into different components. One com¬ 
ponent, L mult (ti ), is given by the probability of observing the particular sizes mi,, m n 
of the sites in Sa’, therefore, it is specified by the multinomial distribution (Q])- Two additional 
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factors are given by the probabilities of the configurations of the links between the people 
in Uk — So, k = 1,2, and the sites A t G Sa . To obtain those factors we will denote by 
Bx \ x = (x\, . .., x n ) G fi, the random variable that indicates the number of distinct people 
in Uk — So whose vectors of link-indicator variables are equal to x, and by R k the random 
variable that indicates the number of distinct people in U k — S 0 who are linked to at least one 
site Ai G Sa- Notice that R k = ^ xgn j 0 i R x \ where 0 denotes the n-dimensional vector 
of zeros. 

(k) 

Because of the assumptions we made about the vectors Xj of link-indicator variables 
we have that the conditional joint probability distribution of the variables { R x J } xe ^ given Sa 
is a multinomial distribution with parameter of size T\ — m and probabilities {"x'' (0 \)}xexi, 
whereas that of the variables { R x } x en is a multinomial distribution with parameter of size r 2 

/ q \ 

and probabilities {vrx (0 2 )}xeo- Therefore, the factors of the likelihood function associated 
with the probabilities of the configurations of links between the people in U k — So, k = 1, 2, 
and the sites Ai G Sa are 


Li{n, 6 1 ) 


(ti — m)\ 

(n - m - ri)! n xG o 


' x • xeO 


,(!) 


( 2 ) 


and 


-^ 2 (^ 2 ! O 2 ) — 


r 2 ! 


(t 2 — r 2 )! n 


xSf! 


r< 2 >! 


n 


,( 2 ) 




Notice that Tq } — T\ — m — r\ and r q 2) = r 2 — r 2 . 

The last factor of the likelihood function is given by the probability of the configuration of 
links between the people in S 0 and the sites A t G Sa- To obtain this factor, we will denote by 
_Rx Ai) , x = (xi,, Xi- 1 , Xi+i,..., x n ) G O-j, the random variable that indicates the number 
of distinct people in Ai G Sa such that their vectors of link-indicator variables equal x and 
by RA^ the random variable that indicates the number of distinct people in Ai G Sa who are 
linked to at least one site Aj G Sa, j ^ i- Notice that RAi) = ^ xgn _ ; _| 0 | R^\ where 0 

denotes the (n— 1)-dimensional vector of zeros and /? 0 4 ' = rrii—RA *). Then, as in the previ¬ 
ous cases, the conditional joint probability distribution of the variables {-Rx^jxeO-i given Sa 
is a multinomial distribution with parameter of size m, and probabilities 
Therefore, the probability of the configuration of links between the people in S 0 and the 
sites Ai G Sa is given by the product of the previous multinomial probabilities (one for 
each Ai G Sa), and consequently the factor of the likelihood function associated with that 
probability is 


£o(» 1 )= n 


rrij 


(Ai) 


n K' 4i) (ei)] ri ■ 4 a, \0 


r- — 


L± FT r Ai )| 

i= 1 1 IxG^ x * xGQ—. 

From the previous results we have that the maximum likelihood function is given by 

L(t\, t 2 , 0i, O 2 ) = T ( i)(ti, 0i)L (2 )(r 2 , 0 2 ), 
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where 


Oi) — Lmult{ti)Li(t\,Oi)Lq(Qi) and (3) 

L(2)(t2,0-2) — L 2 (t2, 9 2)- 


3.3 Unconditional and conditional maximum likelihood estimators of 

(n, 01) 

In this section we will derive unconditional and conditional maximum likelihood estimators 
of the parameters of the previously specified models. Henceforth we will suppose that con¬ 
ditional on the initial sample Sa of sites the following “regularity conditions” are satisfied: 


(1) 9* k is the true value of 9 k . 

( 2 ) 6* k is an interior point of 

(3) 7 T*\0* k ) > 0, x e and 7 ^^(0*) > 0, x e 0_j, i — 1,..., n. 

(4) dtTx\0 k ) / d0f ] , x 6 O and d'K ( ^ i) {di)/d0f ) , x e i = 1,.. . ,n; j = l,...,q k , 

exist at any 9 k € & k and 9\ e © 1 , and are continuous in neighborhoods of 9* k and 9\, 
respectively. 

(5) Given a <5i > 0, it is possible to find an £1 > 0 such that 


inf 


E 




^0 \yi) 


In 


xPffl / 


i-4'’w 


(JV - n) [l-^’W) 


IE 

i= 1 xg!)_i 


n£\0i)/ 

(Ai) (0*)ln 


7 r. 


1 -’To 1) ( 0 l) 

7^(0^) 


7T- 


(*) 


(01 


> £i- 


(6) Given a <5 2 > 0, it is possible to find an e 2 > 0 such that 


inf 

||^2 @2 ||7 X ^2 


E 

xGO—{0} 


1 -4 2 V 


In 


^ 2) (»2)/ 


VTx 2) (0 2 )/ 

1 - 74 2) (0 2 ) 


> ^2- 


Remark 1. For a differentiable function f : W —* M, the notation <9/(x 0 )/dxj represents 
d /(x)/ (9xj| x=Xo . 
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The regularity conditions (l)-(4) and (6) or conditions equivalent to them have been as¬ 
sumed by several authors such as Birch (1964), Rao (1973, Ch. 5), Bishop et al. (1975, 
Ch. 14), Sanathanan (1972) and Agresti (2002, Ch. 14), among others, in the context of 
deriving asymptotic properties of estimators of the parameters of models for the probabilities 
of a multinomial distribution. The particular form of condition (6) comes from Sanathanan 
(1972) who took it from the first edition of Rao (1973, Ch. 5) and it is known as a strong 
identifiability condition. Condition (5) is a modification of (6) to meet the requirements of 
our particular sampling design. In general, these conditions imply the existence and consis¬ 
tency of the UMLEs and CMLEs of 0\ and 0* 2 , and that they can be obtained deriving the 
likelihood function with respect to 0 X and 0 2 . 


3.3.1 Unconditional and conditional maximum likelihood estimators of T\ and 0] 

Let us firstly consider the unconditional maximum likelihood estimators (UMLEs) f[ U ' > and 
§\ [ } of t i and 0\. The log-likelihood function of T\ and 0\ is 


l{i)(n,0i) = 1ii[L (1) (ti,0i)] 

= 1ii(ti!) — ln[(r 1 — m — ri)!] + t x ln(l — n/N) 

+ E x£!! a 111 + E”.! E x£!i _, A- 1 in [4' 4,) (ei)l + C, 


where C does not depend on Ti and 0,, and recall that r 0 ' ; = r, — m — r, and Iq 1,1 = 
m i — r( Ai \ Then, the UMLE 0^ of 0* is the solution to the following equations: 

9l(l){j\,0\) 9 tTx ^ (0 1 ) ^ Tx ^ 0VTx ^(0i) „ . 

2^2^— ATT"--0, J = 1,-, gl- 


99 


( 1 ) 


-=E 




^(©i) 99 


(i) 


i =i xeO_i nx ‘ (^i) 90 j 


( 4 ) 

Since T\ is an integer we will use the “ratio method” to maximize Lm {t\ , 0i) . [See Feller 
(1968, Ch. 3).] Thus 


L(i) (a, 0j) _ n(l - n/N)j: { o\9 1 ) 

— 1, 0i) (ri-m-ri) 


Since this ratio is greater than or equal to 1 if r x < (m + ry)/ 1 — (1 — n/N)^’ (0 x ) 

it is smaller than or equal to 1 if T\ is greater than or equal to that quantity, it follows that t\ 
is given by 

' M + R X 


and 

(u) 


(U) _ 


TV = 


i - (i -n/N)^ (e'd 


( 5 ) 


where [a;J denotes the largest integer not greater than x. Notice that the right hand-side of © 
is not a closed form for since this expression depends on 0\ \ In fact, ff !) and 0\ ! are 
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obtained by simultaneously solving the set of equations © and ©, which is generally done 
by numerical methods. 

A (o 

Let us now consider the conditional maximum likelihood estimators (CMLEs) r, and 

G 1 of t | and 0*. It is worth noting that this type of estimators was proposed by Sanathanan 

(1972) in the context of estimating the parameter of size of a multinomial distribution from 

^ (C) 

an incomplete observation of the cell frequencies. The approach we will follow to derive 

<C ) 

and G x is an adaptation of Sanathanan’s (1972) approach to our case. Thus, from © we 
have that 


Mtl, °i) = f ({^IxenKrrii}, t 1} 0i) 

= f ({^Ixen-tojki, {mt}, t u G x ) / (nKm*}, t u G x ) 

TTx^l) 


r\\ 


rixeo-{o} r * xgn-{o} L 1 


n 


.a)/ 


X- 


(ri — m)\ 


(t\ — m — r 1 )!r 1 ! . 
= Lii(0i)Li 2 (ti, 0i) 


1 -TT^^! 




^o\ d i 


r\ —m—ri 


( 6 ) 


Notice that the first factor Ln(0i) is given by the joint pmf of the multinomial distribution 


with parameter of size r\ and probabilities \ ttx j (0,) / 


7T, 


( 1 ) 


(*i 


xgO—{0} 


and that this 


distribution does not depend on T \. Note also that the second factor L \ 2 {t\ , G\) is given by the 


pmf of the binomial distribution with parameter of size T\ — m and probability 1 

<C) 

Thus, the CMLE G v of G\ is the solution to the following system of equations: 


7T, 


( 1 ) 


(*i 


3 


36 


(i) 


In [LnidjLoiGj] = 


.(i) 


dn£\e 


xEQ—{0} 


7T- 


( 1 ) 


(01 


ddf ] 1 

07^(0 


+ 


T'\ 


Sn’ts 


n 

EE 

i =1 


M3 


no\Oi) 36 


(i) 


TT^ 


i} (0i) 36 


(i) 


= 0, j = (7) 


A (C) 

The CMLE of t\ is obtained by the ratio method. Thus, since 


Lmult {ti)Li 2 ( 7 - 1 , 6 1 ) 
Lmult{t\ — 1)L 12 (n — 1, 0 1 ) 


ri(l - n/N)7T^ ] (0i) 
(ri -m - ri) 


it follows that 


M) 


Ti = 


M + /?, 


1 - (1 -n/N)n£ ] (d[ C) 


Note that ® is a closed form for rj 6 ' 1 since G\ ’ is firstly obtained from ©. 


(C) . 


( 8 ) 


9 






















3.3.2 Unconditional and conditional maximum likelihood estimators of r 2 and 0 2 

By a similar analysis as that conducted in the previous subsection we have that the UMLEs 
t 2 U ' 1 and 0 2 of r 2 and d* 2 are the solution to the following equations: 


E 

xGO 


7"x 2) d-K x } (0 2 ) 

vri 2 \e 2 ) 39f 


0, j = 1, ■ ■ • , ?2 


and 



R-2 


1 — 7T, 


( 2 ) 



(9) 


where recall that r^' 1 = r 2 — r 2 . 

.( C ) 

With respect to the conditional estimators, we have that the CMLE 0 2 of 0* 2 is the 
solution to the following equations: 


E 

xEf2—{0} 


rx } di Tx 2) (0 2 ) 

ir { *\0 2 ) 39f 


r 2 dn { o\6 2 ) 

l-^\e 2 ) 39f 


o, j = l,...,q 2 . 


The CMLE t 2 ' ] of r 2 is given by ©, but replacing 0 2 > by G 2 ' *. Note that in this case © is 

A ((J\ 

a closed form for f 2 . 


3.3.3 Unconditional and conditional maximum likelihood estimators of r = t x + r 2 

The UMLE and CMLE of r = T\ + r 2 are given by f ^ = if 7 ' 1 + r2 7/) and f= f 1 ^ C '' ) + t 2 C J , 

respectively. 


4 Asymptotic properties of the unconditional and condi¬ 
tional maximum likelihood estimators 

The structure of this section is as follows. Firstly we will define the asymptotic framework 
under which we will derive the asymptotic properties of the estimators. Next we will state 
and proof a theorem that guarantees the asymptotic multivariate normal distribution of any 
estimator of {t\ . 6]) that satisfies the conditions expressed in the theorem. Since not any 
estimator of (ri, 9 \ ) satisfies the conditions of the theorem, in particular the CMLE does not, 
we will state and proof another theorem that guarantees the asymptotic multivariate normal 
distribution of any estimator of 6] that satisfies the conditions of that theorem. Then, we 
will prove that the UMLE of (ti, 9\) satisfies the conditions of the first theorem, whereas 
the CMLE of 6\ satisfies those of the second one. In addition, we will prove that in spite of 
that result, the CMLE f i f ) does have an asymptotic normal distribution although it is not the 
same as that of ff J] . After that we will consider the asymptotic properties of estimators of 
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(t 2 , 0 * 2 ). Since this problem is exactly the same as that considered by Sanathanan (1972), we 
will only state a theorem that guarantees the asymptotic multivariate normal distribution of 
any estimator of ( 72 , 0 2 ) that satisfies the conditions expressed in the theorem, but we will 
omit its proof, as well as the proofs that both the UMLE and the CMLE of ( 72 , 9 2 ) satisfy the 
conditions of that theorem. Finally, we will obtain the asymptotic properties of the estimators 
and t of r. 


4.1 Basic assumptions 

To derive the asymptotic properties of the UMLEs and CMLEs of r k and 0* k , k — 1,2, we 
will make the following assumptions: 

A. r k —> 00 , k — 1, 2. 

B. Tfc/r ->■ a k , 0 < a k < 1, k = 1, 2. 

C. IV and n are fixed positive integer numbers. 


For convenience of notation, we will put 77 ,, either as a subscript or a superscript of every 
term that depends on r k , k = 1, 2. In addition, convergence in distribution will be denoted by 
—f and convergence in probability by 

Notice that from © it follows that the conditional distribution of M} n> given T\ is bino¬ 
mial with parameter of size T\ and probability 1/N, that is M^\t\ ~ Bin(ri, 1 /N); con¬ 


sequently M- T1 ^/ti is stochastically bounded, that is, M/ T1> = O p (ti). This means that the 
size of U\ T1 ' > is increased by increasing the sizes of the clusters, even though their num¬ 
ber N is kept fixed. In the same manner, the number of people in the initial sample Sq T1 \ 


(n) 


given by M^ Tl) = 


(n) 


Ti 


Bin(ri, n/N), is increased because of the increasing of 


Mi , i = 1,..., n, even though n is kept fixed. On the other hand, since T\ — M^ ri ^|ri 


Bin(ri, 1 — n/N), i7^ Tl) |S'^ Tl) ~ Bin(ri — M^ T1 \ 1 — tTq 1 ^ and R < £ 2 ' ) \S ( /[ 1 ' > ~ Bin(r 2 ,1 ■ 


■7r, 


(2K 


it follows that .R^|ti ~ Bin ly, (1 — n/N) (1 — 7Tq 


therefore /i, r ' j = O p (t\ ) and RJ 2 = O p (t 2 ). Thus, the sizes of the sets S\ n> and S. 
are increased because Ti and r 2 are increased even though the probabilities {ttL' '} x eo and 
{7 Tx 2) }xgo are kept fixed. 

We will end this subsection presenting the conditional and unconditional distributions of 
the variables R^\ R/l' 1 ' and R// 2) which will be used later in this work. Thus, from the 

Bin(Ti — 
Tl 


5 ( 72 ) 


r w 


and i? 2 T2) | r 2 ~ Bin(r 2 ,1 — 7Tq 2) ) 
(n) 


y(T 2 ) 


multinomial distributions indicated in Subsection 3.1 it follows that R/ '| 5^ 


?(n)i q(ti) 


M (n) , 7 ri 1) ), i?x 4i ^ | M - T1 ^ Bin( M - T1 ^ , tt^) and R/[' 2) |S'Y i; ~Bin(r 2 , 7r 


( 2)7 


therefore /?!/' * 


-E>in 


Ti, (1 - n/N ) 7TX 1 ’ , i?x Ai) |Ti ~Bin^ri, 7 t^/n'J and R { R 2) \t 2 ~Bin(r 2 , 
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4.2 Asymptotic multivariate normal distribution of estimators of (ti,0*) 

Theorem 1. Let d\ = {9\, ..., be the true value of Q\. Let f\ T< 1 and §[ l) = (0^4..., 
9^) be estimators ofri and 6 \, such that 


a) 0S T1) 4 <n. 

(ii) rr 1/2 {r™ 



/ 1-(1 


n/N)n ( 0 l) 



4 o. 


(iii) t[ 


- 1/2 


Vl) V r l ) “i J 




4o,i 


1, •••,?!• 


In addition, let E ( 1 be the (q\ + 1) x [q\ + 1) matrix whose elements are 

[sr‘] u =[i - (1 - n/AO^’W)] / [(1 - n/N)^\e\)] , 

Pr’Lj+HsrVu = - [v^w)] [k/wvk 11 ]. i = i,• -• ,<?i, 

[^r 1 ] i+llJi+1 = [sr 1 ] , +1 , i+1 =(1 - ^)E[ 1 /^ 1) (e;)]f«4 1) (e;)/s«f > lK 1) (9D/5« 


3 ( 1 )' 


xgf! 


AT 


E" =1 E«n_ ivk'w)] k'^wi/k 1 ’ 


i,3 = 1, ■■■,?!, 


and which is assumed to be a non-singular matrix. Then 


T, 


-1/2 /k(n) 


n - r i h T i 


1/2 /kn) 


- 0* 


4/v, 


91+1 


(0, Ei 


where Ei is inverse o/E 1 1 0 = (0,..., 0) G M 91+1 . 
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Proof. Evaluating equation © at (jf'" 1 . O^) we get 


9 


/(tl) ( At i) 

Vi 


E 


f^ Tl) <9 t4 1} ( e [ Tl) ) foo - moo - 4 ri) ^ 1} (0i n) ) 


xeo-o ttx ) (^i ri) ) dOf'* 


+ 




( 1 ) 


R (AhTl) dni Al \§[ Tl) ] 


EE . smC 1 ) 


1=1 xSf2_i 


7rr u (0rO 90 


r£ i] 


dirPiO™) 


+ 


(i)/a( t i)n xqA) 
xGf2 7Tx ^ (0i ) dU j 

+E E R ^ l ’ Tl) d7T * Al) 




dxPitt') 


Ai ) 


(i) 


1=1 xGf!_; 


A a i)(q{ t l)\ 
VTx (0i ) 


dd 


(i) 


V- fl!." 1 aAA) . ^’-n < ) (e; n) ), ^ ^ 

2^ m„a(nk Q „(i) + m^(n), Q „(i) + 2_^ 2^ 


Jf A l > T l) 


A 


a^HA’) 


( 1 )/«( T1 ) 
xeo TTi '(0 1 


<90 


4 1, (e l r ) ) 96 : 


s«f 


Since 


E^’l^'Vssf^o and y d^\e^)ldef = 0, 


(nk 


( 10 ) 


( 11 ) 


xGH 


xG^_ 


from m we g et that 


-1/2 ^x n) - (ri - M dd )^\0\) dTi£\d[ Tl) ) 


K xEf2 


TTx (6>! ) 


<90} 


(i) 


r( A 1,Ti) 


y^ -^x — d7T^ l \o[ ^ ( -i/2 9 ( T i) /„(n) -s(n)\ 

2. 2. ^j^)) 9 0O) f Tl Sp™ l r i ^ J 


Z=1 xEf2_ 


= -rr 1/2 ff, (Tl) 


Tl) 




.^0 ( 0 ! ) 


90 


( 1 ) 


1/2 \ Ti~ M (T1 > y^ 7rO ) (9S Tl) ) - 7^ (0J) d'R^\§[ n) ) 

‘ ' ^ .Vo A<A) 80f 


Mf 1 ’ V- 7r^)(0S ri) ) - 7T^)(0t) 97T^ ) (0S ri) ) 


r(n) 

E^E 


;=i 


Tl 


xGO_ 




90 


(i) 


( 12 ) 
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Let y x (Tl) = Rp l] - (n - r x (Ai ’ Tl) = Ri Al ' Tl) - m\ t1 \P 1 \0\) and 

? (A '’ ri) 9 tt£ 4,) (0J 


7 (n) _ —1/2 \ ^ -Rx n) \ - R 

i +1 1 Z^_(i)/«*n a /,(i) 


xgO 


90 ^ 


= T-, 


_ ^_—1/2 ^ 07Tx^(fl*) ^ 

^7ri 1} (6>t) dOf> ^ ^ 


xGfi 


i=l xGSl_ 


7^(01) 86 ' 


( = 1 XgO_; V~ 1/ -J 

(^i.n) r,Ui) 


3 


(n) 


where the last equality is obtained using (ITTb but replacing 0 { by 6\. Then, the difference 
between the left-hand side of (fl2l) and Z^{ is given by 


-- 1/2 


E 


r^> d^\e[ T ") 


xett^x (#i ) 


86) 


(i) 


EE 


Yx (Ai ’ Tl) 97^ (0^ 


1=1 xeSJ_, vri A;) (0[ Tl) ) d6f ] 


-Ti 


- 1/2 


8 


86) 


(i) 


/(n) /-(ri) n l ' Tl ')\ _ y[r \) 

f (l) (Jl ’ W 1 J z i+i 


= Ti 


- 1/2 


E^ 

, xeo 


n) 


s^/ef 0 ) 


a/ 1 ’TO 




de 


(1) 


4 1] (0t) 86 


(i) 




+ E E Y ± AhT1) 

1=1 xgf!_; 

-!/2 8 ( T1 ) /.(n) n (ri A 

1 00(1) *(1) l T l J 


97T^ ) (0[ T1) ) 


9^(0*) 


90 


(i) 


4^(0*) 90 


(13) 


Since unconditionally E(Yp ri ' > ) = 0 and C(Y x <ri) ) = Ti(l — n/N)nP > (0*)[1 — ^(0*)], 
and also E(Y^ Al ’ Tl) ) = 0 and V(Y± Al ’ Tl) ) = n(l/iV)7ri Ai) (0t)[l - ^ Ai) (0*)], it follows 
that r 1 ~ 1//2 l'x l) = O p ( 1) and t\ U2 y) a ^ t ' ! = () p (l). Consequently, these results along with 
conditions (3)-(4) and conditions (i) and (iii) of the theorem imply that (fl3l) converges to zero 
in probability. 

On the other hand, by the mean value theorem for functions of several variables we have 

that 

a/lCE-a/TO = EE (e , -»; i )a4 1,( e ( 1 ; i))/ a», w and (14) 

4- 4,) (<C ) )-4' 4,) (e;) = E“i («!?’-%) ^l 4|)(9 S)/ a »‘ 1) . 


where Op 1 and 0^ x are between O p J and 0\. Since the difference between the right-hand 
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side of (fl2l) and zj/E also converges to zero in probability, we have that 


- n) 

1/2 (n- 
1 

+ EZ— E 


aZ\0? ] ) 


-7T ( 1 )rfl (n), i 
.^0 v“i J 


(i) 


V_-_ 

(i )//;( T i) 
xeo Trie (0 1 


86 


o ( 1 )/^( Tl )\ <n 


86 


(i) 


2=1 


lz 


(!) I(}6 ri )\ 
lx > 


8^\e 


86 


(i) 


1=1 


sr 1 


T\ z -' -7 

xEf2_ 1 TTx 


^ (e ‘ } EHi 0 -" 


3+1,1 


- 1 / 2 , 


90 


91 


( 1 ) 


lz 




2=1 


90. 


(i) 


- Z) 


(n) 


j +1 


if 1 ’ - -n)] + E [^c 1 

2=1 


3 + 1 ,i+i L 




1/2 (Mn) 


o\ T 11 /)* 
^li “li 


- z%} 4 0 . 

(15) 


where 


S[ 

sr 1 


-1 


3+1,1 


3+M+l 


a^'teZ 


Tl 


(l)/A( T1 )i 

K (&i ) 

mm 


86 ) 


(i) 


and 


+i 


E 


1 a^\e[ T,} ) d^\e^) 


<*-(!) 

xGO VTx (01 , 


n /\,f( T l) 

E^E 


? 0 j 1} 90f } 


S' Tl xX^ !) ( 0 l Tl) ) 


90 


(i) 


90 


(i) 


(16) 


Expression ([5]) suggests the following equality in terms of f[ Tl) — t x and ttq 1 ] ) — 


- 1/2 /-(ri) 


i (Tl) [1 - (1 - n/N ) 4 1 ) ( 0 j Tl) )] - (m™ + } = rr 1/2 (ff 0 - n) 

x 1 —(1 —rz/iV)7r4 1) ((9i" l) )] -r^ 72 { (m^+R^ -r, 


-n l72 (l-n/iV) \*$\§™)-«S>W) 


By condition (ii) of the theorem it follows that the left hand-side of the previous equation 
converges to zero in probability. Therefore, if we divide the right hand-side of this equation 
by (1 — n/N)7TQ (0*) and use (fT4l) . we will get that the following expression also converges 
to zero in probability, that is 
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.-1/2 /~(n) 


r l - T l 


1 - (1 - n/A04 1) (e l i n) ) 




-Ti 


- 1/2 


M( T1 ) + i?! ri) ) - n 1 - (1 - n/iV)7r^ j (6>i) 


(i)/ 


91 


(«<?>-» 


(l-r,/JV)4‘>(9t) 

1 +r< 1) (<' ) ! 


lz 


= =r 


2=1 

-1 


4\ei) 


89 


(i) 


1,1 L 


Ti 


-1/2 /~(n) 


r i -n 


E 91 |V 

^i=i L 


J i,i+i L 


Ti 


1/2 /fl(n) 


1 




-Z| Tl) 4 0,(17) 


i(n) • , ._ a ( t 8 


where ' is between G 1 and G\ and 

i-i 


sr 


1 - (l-n/N)7v { 0 l) (G[ ri) ) 


J 1,1 


(l-n/N)^\8l) 


sr 


-l 




-I l,i+l 




89 


(i) 


(18) 


and 


y( T i) 1/2 

Z 1 — r l 


M (n) + ^(n) j _ Ti l _ (l _ n /N) 4 lj (0*) 


(l) ( 


Let W[ ri) = [t( 


-— 1/2 /'(n) 


Ti 


1 


(l-n/iV)^^) 

r 1 ),r 1 1 / 2 ( 0 i Tl) - 01 )]' and Z™ = [Z™, Z< n) , 


,4 


(n) ]/ 
91 + 1-1 ’ 


by the previous results we have that 


E('wj Tl) - Z (Tl) 4 0, (19) 

where 54 1 is the (gi + 1) x (gi + 1) matrix whose elements are defined in (fT6l) and (fl8l) . Notice 
that from the definitions of the matrices E^ 1 and E^ 1 , conditions (3)-(4) and condition (i) of 
the theorem along with the fact that (r± — } 4 1 — n/N and Mj Tl > /t\ 4 1/iV, it 

follows that Ej) 1 4 S^ 1 . 

We will show that Z (ri ^ 4 Z ~ jV gi+ i(0, S)" 1 ) as T\ -+ oo. To do this, we will associate 
with each element t G U\, t — 1,..., T\, a random vector Vj4 = \ , V^ +1 ]' such 

that 


(a) Vj;^ = 1 and V/8 +1 = [7ri 1J (0J)] 1 8-k^\G* 1 ) /89^ } , j = l,...,q 1 ,ifteU 1 -S 0 and its 


r(l) 


r (!)/ 


rW 


( 1 ) 


associated vector 9Q 1 of link-indicator variables equals the vector x e Q — { 0 }; 


(b) Vg> = - 


1 - (1 - n/N)w^(Bl)} / [(1 - n/N)^\e\) 


and V, 


(!) _ U(l) 


t,j +1 


= VTn 


\0l)] 


-1 


x8'Rq\g* 1 )/89^, j = 1 ,.. .,qi, if t <E Ui — S 0 and its associated vector x[ V) of 
link-indicator variables equals the vector 0 G Si, and 
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(c) V t ? = 1 and V t ? +1 = [tt^ 4 z) (0*)] l di t^ 1 \0\) /d6f\ j = 1,..., q u if t e A t G S A and 

its associated vector X? of link-indicator variables equals the vector x e Q_/. 

Since 


n 1 / 2 E^« 


t =1 


(1) _ _-l/2 

tl — ' 1 


_ 7 On) 


n 


>+ - P - mm - Rf) i-ti-V^W) 

7 V 7 (l-n/JV)ir<’(9;) 


and 


E /2 Ekl 


Tl 


( 1 ) 


tj+l “ r l 


- 1/2 


t=l 


E 


f?x n) ^ R^ l,Tl) dn^ l \ei) 

n£\et) 89? 


+E E 


««_, *r'(e;) 90, 
i = i, 


= Z K - 


(ri) 


3 +1 


Pr <! v£ } = 


it follows that Z (n) = r, V?. 

From the definition of V t ? we have that 

Pr jyW = l| = (1 - n/N ) 1 - + n/N , 

i - (1 - n/AO^’P;)] / [(1 - } = (1 - n/AO^W), 

Pr {kill = [4 1 ) ( 8 i)]" lair i 1 ) (®i)/ 8 *j 1> } = (l-ii/A r )4 1 ) ( e i). x £ n, j = 1 ,..., <ji, and 

Pr{v£li = [rr<- 4 '>(e;)]- 1 54 - 4 - ) (et)/se< 1, } = (i/iv) 4 - 4 ->( 9 ;),x 6 n.,.j = 1 .<n, 

Z = l,...,ra; 

therefore, the expected values of the variables V t ? are 

E (k, 11 ) = (1 - n/N) [l - TT^te;)] + n/N - [l - (1 - n/N)4\8\) 

and 


= 0 


B(pjli) = E 9 ^ 1) WV a E ) ( 1 -"/ JV ) + E E 8n^\e\)/def(\/N) = o, 


xgfl 


1=1 xgf2_ 


J = !, 


because of (fill) . Thus, E ( V) 1 ^ ) = 0, t = 1,..., Ti. Furthermore, their variances are 


v (V,?) = (1 - n/N) [l - jr?>( 0 ;)J + n/N + 

1 - (1 - n/N)irlW) 

(1 -n/N)*S\6‘) 


1 - (1 -n/N)^\e\) 


1 2 


(l-n/N)4\e\) 
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and 


V f V t 


'''"'>-"-* OT S4VS 


d^\e\) 


90 


(i) 


1 

'n 


EE 








(l) 


,3 = 


and their covariances are 

r(l) T/ (l) 




xe£j-{0} 'V 




x 


(i- n /jv) + 53 ;r 

Z=1 xS 

i s^'V; 


i-(i-n/'v)4 1 ) (g;)^o 1 ) (g; ) 

(i - n/j vJttJ 11 ^;) a/ 1 ’ 

Sir^’te;) i 

aof iv 


7T, 




90 


(i) 


Co<>(v!)l „v;« + i)=(i-n/]V)E 


, j = 1 ,... ,gi, and 

_i_ d^\d\)d^\ez 

rO/a*! on! 1 ) on! 1 ) 






xen^W) dO? 

1 97ri j4i) (0t)97r^ 4i) (0*) 


:=l xeo_i ^ ^(^i) 36 


3 ( 1 ) 


90 


(i) 


.7, / = 1, ...,gi, 3 ± 3 ■ 

Therefore, the variance-covariance matrix of v[ n is E, 1 . 

Finally, since the V, , t — 1,..., r 1; are independent and identically distributed random 
vectors, by the central limit theorem it follows that 


Z (m = rr 1/2 XE v ‘ (1) ^ z ~ jv«+i(o, sr 1 : 


Consequently by (fl9l) . 


W} ri) = 




-1/2 / ~(ti) 


n), r i 


1/2 


0--_6in 4 e 1 Z~JV, 1 + 1 (0,E 1 ) 


as Ei —)■ S 3 


□ 


4.3 Asymptotic multivariate normal distribution of estimators of 6 \ 

Theorem 2. Let G\ = (0£,..., 0*J t/ie trwe va/ne o/9i. Let §[ lJ = (0^,..., g\ be 
an estimator of G\, such that 

0) e { [ l] 4 e\. 
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(ii) r[ 


- 1/2 I a 


ae 


ITT 


In 


l[ ? ) (e < r l )4” ) (« ( r ) ) 


->0, j = 


In addition, let 1 be the q { x r/i matrix whose elements are 

[»r‘]y = [® , r 1 ] J>i = (i - n/N)[ i - 4'V;)] 

x 5] [lM'Ze;)] fe'HetJ/s#! 1 ’! feVsD/a/ 1 ’ 


xGfi-{ 0 } 

l 


+lvE”.iE xe „_, fMEs;)] [a4' 4 ‘ , («;)/a», <1) ] [a4 4 ‘ ) (e;)/a«‘ 1> 

hj = 1 , 

where {6*f = 7 Tx^( 0 *)/[l — 710^(0*)], x e f 2 — { 0 }, rmJ suppose that fi/f 1 is a non¬ 


singular matrix. Then 


. 1/2 


ef’-eil 4 jv„(o,*,), 


where \l/i is the inverse of \E r 1 1 and 0 = (0,..., 0) G K 91 . 

X (Vi ) 

Furthermore, if is an estimator of T\ such that 
(iii) r” 1/2 |f 1 (ri) - / 1 - (1 - n/N )vrj ) 1) (0^° 

then 


4o, 


r i 1/2 (4 Tl) - n) 4 iv(o, of), 


where 


cr i = 


1 — n/iV 


1 - (1 - n/N)^\0\) 


4 1> («;) + 


(1 - n/JV) v^, 1 ' (»;) 




Vir?’ W) 


1 - (1 - n/N)^\e\) 


( 20 ) 

and Vtt^ (0*) = dir^ (0*) /d6\ V \ ..., (0*) /d6$ is the gradient of ti^' 1 {Of eval¬ 

uated at 0*. 

Proof. From the definitions of L^\d\) and L l J l > (0i ) we have that 


d 


d9 


(i) 


In 


Lq\e , ;‘ , )L^\e[ r,) : 


E 


RF 8n iV#; 


(lEfl (T1 ^ 


xen-o 7rx' ) (^i j) ) 


EE 


jj>(4;,Ti) 


( 1 ) 

? 

07^(0^ 


Since 


^ d ; k ( £\d^)/d6 < 'p — 0 and ^ d'K^ l \d'^ 1> )/d6 i ^ > — 0, 


‘—- X—. (Ai)/fl( r i) X 
i=i xeo_i Trie '(0 X , 




90 


(i) 


o(l) 


( 21 ) 


( 22 ) 


xgf!—{0} 


xgf!_i 
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from (HIT) we get that 


-- 1/2 



Rp l] - R\ n >n^(Ot) 0^>(0\ 


o~(i)/n (ri K 




06 


(1) 



R^ l ’ Tl) - m\ Ti) 7ri Al \ei) OnL Al \o[ Tl) ) 




as 


in 


E 




#t 1 ) (e;)s 4 1 , (e^ ) ' 


xen-{o} 

(n) _ -^(Z)//S^ 1 ) 




(i) 


06 

/ J 


Ti 


1=1 J ‘ xgn_j 
d r i) _ r?( T i) e>( t i)~(1)//i*\ \ZZ>ti) 




06 


(i) 


(23) 


Let E x w = RX 1 ’ - Y^ UT1> = R (Al ’ Tl) - and 


Kn) 


= T- 


- 1/2 


v- R x n) ^x 1 } ( 6 »D V- y- 

2^ ~(i)/«*n Q/1 (i) + 2^ 2^ 


xgf2 


_ ^-V 2 


= Ti 


-{„}** (*D 

v >x (Tl) d4 1} (0D 22 v 

2 ^ -(!)/«*>> Q/ 1 (i) + 2 ^ 2 ^ 


xEf 2 


rlo } **(«!) 


{Tl) 7r { x Al \Gl] 

R { P l ’ Tl) dvr^^D 

,=1 *=n_, Z/e;) Be? 

y x (A * ,ri) cb 2 Ai) ( 0 D 
vr x Ai) ( 0 ^) oe { - ] 


1=1 

a(ti) 


where the last equality is obtained using (l22l) but replacing 6 *, by 0). Then, the difference 
between the left-hand side of (1231) and Z ( p 1 is given by 


_—i/2 J 3x ( 0 dnx Ho 

xgO-{0} TTx (“i 


1 J + v V _ 

J=1 xeSJ.i vri \6 1 


Yr 1 ’ 11 ’ dn^pep 1 ^ 


06 


(i) 


06 


(i) 


— 1/2 0 
' T ' 111 


06 


4? ) (ei n) )4 T °(eJ n> )' 


z ; 


(ti) 


J 


- n.~y* 


= r- 


E 

xgf2—{0} 


+ E E y Z 

1=1 xEf2_/ 

- 1/2 ^ 

- T i 


.Tl) 


1 

a^/ef 1 ’) 

i a^Vi)’ 


U* (#1 

oef 

4 1} (6»t) 90f 


1 

OnP l) (6p ] ) 

1 97ri Ai) (6>*)’ 


06? 

vri A,) (6>t) 90] 1} 





L<? , (9 < r ) )4 r ‘ ) (9f i) ! 


(24) 


J 
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Since = O p ( 1 ) and t[ 1/2 Y^ Ai ’ ti) = O p (l), these results along with conditions (3)- 

(4) and conditions (i) and (ii) of the theorem imply that ([24b converges to zero in probability. 
On the other hand, by the mean value theorem of several variables we have that 


= EL 4 - and (25) 

= EL - %) 


where 0^ and Oj^l are between d\ ' J and 0*. Since the difference between the right-hand 
side of (l23l) and Z- also converges to zero in probability, we have that 


I 73V 

.1/2 I -ttl 


(n) 


E 


d^\G[ T1 ^ — 


Tl x e ^{o}#(0l Tl) ) 


i =1 


E 1 - 


( x ) z/a( T1 ) > 

lx 7 


dn£\0 


w n . 7\ / /‘(' ri ) ._ 

e^Ve 


i=i 








at) 


(l) 


2=1 


1 i 


aef 

dni Al) (0 


Afx.) 


89 


(i) 


<n 


E *r‘ 


2=1 




A' 2 (e!" ) - 


z<"> 4 o. 


where 


*i 


-l 


R\ 


Oi) 




E 


S7r< I) (eS I1) )s4 1) (e!; ) ) 


Ti ^ 

xGO-{0} VT x ft?! , 


E^E 

i=i 


f 89 ; i} 


Tl xSl,^^) < } 99^ 


Z (n) 


(26) 


(27) 


Notice that from the definitions of the matrices and 'h, 1 , conditions (3)-(4) and 
condition (i) of the theorem along with the fact that /r\ A (1 — n/N ) [1 — ( 0 *)] and 

M/ Tl) /ri 4 1/N, it follows that (bf 1 4 4 r ^ 1 . 

By condition (iii) of the theorem and using exactly the same procedure as that used to 
obtain expression ([171) we will get that expression which we will put in the following terms: 


ai 


Ti 


-1/2 (At i) 


Tl 


E Qi ^ 

i= i ai+1 


r i /2 («!?’ - 


z (n) 4 o, 


(28) 


where 


ai = 


l -(l-n/'VK 1) (e l i r,> ) 

(l-n/NfrPW) 




Z (ri) = rj 


- 1/2 


"? ^ 2+1 


M (n) + ^(n) \ _ Ti i _ (i _ n /N)n ( o\0l) 


8^(0 


10 


36] 


( 1 ) 


i = 1 , 


>9i, 


(i), 




(29) 
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and 6[ T q ] is between 6\ 11 and d\. Notice that conditions (3)-(4) and condition (i) of the 


theorem imply that a* —* a*, i = 1 ,..., qi + 1 , where a\ = 

(1 -n/N)^\G\), and a m = - [cbr^ {O^/defA /i^\0\),i = 


(1 -n/N)^\Gl) / 


Let Z( Tl > = 


4 


(n) v (r i) 


7( t l) 
J <?1 


, then by the previous results we have that 


*4 


r l/2 /^(n) _ ^ 


- z ( ti) 4 o', 


(30) 


where 'E , 4 is the q 1 x q\ matrix whose elements are defined in i 

We will show that Z (r ^ 4 Z ~ AT gi (0', 'J'] -1 ) as Ti —> oo, where Z = (Z l5 ..., Z gi ) r , and 
that Z^ 4 Z ~ iV(0, ai), where 4 ri ) is given by (|29|) . To do this, we will associate with 
each element t E U 1 ,t= 1,..., r 1; a random vector Vj 1 ' = [V t ^\ ..., V^j]' and a random 
variable 1/4 such that 


(a) vj^ = [44 (0*)] l d^\0\)/def ] , j = 1,..., q lt and V t w = 1, if t E Ui - S 0 and its 


rO), 


□(!) „• _ 


(1) _ 


associated vector Xp of link-indicator variables equals the vector x 6 Q — { 0 }; 

1 - (1 - n/N)4\ei)} / [(1 - n/N)4\e\) 


(b) V t f = 0 ,j = l,...,g 1 ,andVf ) = 


\ftElI] —Sq and its associated vector X ) 1 j of link-indicator variables equals the vector 
0 E 0, and 

(c) vff = [t T¥ l \e\)}- 1 dTr¥ l) {e\)/dOf\j = and V^ (1) = 1, if t E A t E S A and 

its associated vector Xp of link-indicator variables equals the vector x E Q_/. 


Since 


t 1/2 Ek 


n 


t,j ~ T 1 


t= 1 


E 


i?x Tl) <944 (6>D A x- f?^’ Tl) <94444) 






= zj Tl) , J = l,..., gi , 

it follows that Z (n ) = r ^ 1 ^ 2 Vp\ and 


(i) 


EE 

i=l xGO_; 


7rr'(e;) ao 1 


E /2 E v, 


t=l 


( 1 )_ - 1/2 
— 11 


= z (ri) . 


M<™> + fi/‘> - U - M™ - 4 T,) ) 1 - (! - 

V 7 (1 _„/jv)4')(e;) 
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From the definition of and V]'' 1 we have that 


Ki) 


Pr { V S = [4 1, (9;)]' 1 »i 1, (e;)/» J (1> } = (1 - n/N)^\6 1;), x e SI - {0}, 

3 = !,•••, 9i, 

Pr {v t ( ]> = o} = (1 - n/A'i^Ve;), j = 1, ... , 9l , 

Pr{ p y> = [4 J,) (e;)] _1 S4' 4 ‘ ) («;)/S9f } = (l/iV)4' 4 '>(9D.x e fi-,, j = 

l = !,■■■, n, 


and 


Pr i/ t (1) = H = (1 - n/N ) 1 - 7 r^ 1J (0*) + n/iV and 


,(i). 


Pr 1 K (1) = - 


i_(i_ n /jv )lr (i) W ) / (i_„/jvK'(e;) } = (i-#K ,, («;) ; 


(l)/ 


r (l), 


therefore, the expected values of the variables V/j and V, are 

£ (Ef) = E torPm/dtf-H 1 - ™/W) [l - 4‘Vl) 

xGO—{ 0 } 


E ]T s4- 4 ‘) ( e;)/s»‘ 1> (i/jv) = o, j = i,..., 9l , 

i=l xgQ_i 


and 


£(V/ 1)N ) = (1 — n/N) [l — tt^ 1 ) (0*)] + n/N — [l - (1 - n/N)^ \d\) 


= 0 


because of (l 22 l) . Thus, E J = 0 and E [Vt^ J = 0 , t = 1 ,..., T\. Furthermore, their 
variances are 

r m i2 

-1 C>~ (-L)//1*\ 
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EE 
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7r^ } (0t) 
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, J = l, •••,?!, 


and 


V (y,* 1 ’) = (1 - n/N) [l - + n/N + 

i _ (i _ „/n)7,S\bi) 

(1 - n/N)^\e\) ’ 


1 - (1 - n/N)*W(0l) 


(1 -n/N)^\e\) 
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and their covariances are 


c™ (ri‘>, ri‘»)=(i - j) [1 - *r«(e;)l 5] 


1 dn£\dl)dn 4(6>t) 


(!)/ 
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3(1) 


vEE 




(i) 


9» 


jW 




( 1 ) 




and 




.(l)/ 


g^(gt) 

aP 1 ) 

xgn-{0} 


+ vEE 




1=1 xEf2_j 


90 


(i) 


- 0 . 


Therefore, the variance-covariance matrix of vj 1 * is V E', 1 • 

Finally, since the (V f (1 ' )/ , v} 1} )', t — 1,..., t±, are independent and identically distributed 
random vectors, by the central limit theorem it follows that 

Tp 1 O' 

0 ai 


(Z (ny Z fr)y = Ti -i /2 ^ (v p, v^y 4 (Z', Z)' ~ 9 V, 1+1 l u 9l+1 , 


AC 


0 ' 


Thus, Z ( T1 ) 4 Z ~ i\T 91 (0', T', *) and .4 ri ) 4 Z ~ N(0, ai). Consequently by 


D 


T, 


1/2 


4(^ 1 zy~iv, 1 (o,^ 1 ) 


as \I>i ->■ 

At last, from (l28l) and the previous results 
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(l-n/AQggo;) 
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where f^iZlj is the z-th element of 'T i Z and 


cr 2 = 


1 — n/N 


1 - (1 - n/N)7rS\el) 


PW) + 


(1 -n/N) vg 1 ’ (»;) 


*i 


vg 11 («;) 


i - (i - n/vjgVl) 


□ 


4.4 Consistency of the UMLE and CMLE of (it, Q{) 

To prove the consistency of the UMLE and CMLE we will use condition (5) and the following 
inequality of information theory: If ]P a* and )P bi are convergent series of positive numbers 
such that J2 a i > X! 4 then Yj a i log (4) < ^ a, logfa,), and the equality is attained if and 
only if ^ = bi. See Rao (1973, p. 58). 
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4.4.1 Consistency of the UMLE 

t(E0 


Let us first consider 6\ '. Using © and © and the definition of the UMLE ( t[ U> ,§[ 1 ) we 
get that 


h) (u l u \d ?’) = R< * ] ln {^ 1} } ) / 1 ~ 7l o ) (®i 

xGO—{0} 


(u) 


E E R ^" n> ‘>1 rU (»; 

i= 1 x£f!_i 


(U) 


+ In 


J MULT I T\ 


,(U) 


+ In 




+ c 


> E in{4" («;>/[j-4”(«;>]}+E E R *‘ iT,> ln U 4 " (®; 

xgf2—{0} i= 1 xgQ_i 

+ l n [ Lmult ( r i)] + In [L 12 (ti, 0*)] + C = Z(i) (ri, 0*), 


where C depends oniy on observabie variabies. Since ln 




(u) 


-'MULT l T i 


,(U) 


and ln 


t (AU) 
L V2 l U , 


are nonpositive we have that 
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using n + 1 times the previously indicated information theory inequality we have that 
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Thus, by (fTil) and (l32l) we get that 
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(33) 


From the unconditional distributions of m[ T i \ and i?[ Tl \ /?4' T| ^ and J?x 4l,Tl) indicated 
in Subsection 4.1, it follows that R^/R^ 4 7p4(0i)/[ 1 — 7r o 1) (^i)]’ R ^’^/4 
7Tx Ai) (0p and M- T1 ' ) /R^ 4- l/{(iV — n)[l — 7To^(0*)]}. Therefore, the first two summands 
of the last term of the double inequality (1331) converges to zero in probability, In addition, 
since /t\ 4- 1 — vTq 1 ^ (0*), and from well known results of large deviations theory (see 
Varadhan, 2008), we have that for the binomial probability L 12 (t 1? 0*): 
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converges to zero 


The previous results imply that the last term of the double inequality 
in probability, and consequently so does the middle term. 
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Thus, 
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are bounded as ri —* cx) (otherwise the middle term of the inequality (l33l) would 
not converge to zero). Finally, condition (5) implies that for any tg > 0 we have that 


Pr 


e™ - ei 


< <£ \ —> 1, that is, of ^ 6\. 
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Straightforward results of the previous one are the following: "x ! [ 0\ J -£ tTx ’ (O f), 

x G f2, and (of A -3 7r£ A ^(0*), x G f2_ i5 i = 1,...,n, as 7rx^(0i) and 7Tx 4i) (0i) are 


assumed to be continuous functions of #i. 

With respect to t [ U> , from expression © we have that the difference between t[ U) and 
+ R^ 1 ^ / 1 — (1 — n/N)^' 1 ^0^ ^ is less than 1. Thus, j t± U ' > — / 

1 - (1 - n/N )vrf, 1) (flf)/ n = ff^/Vi - f^ (ri) + R^A /n / 1 - (1 - n/JV) 
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(i) (ef) 


bility so does t\ /t\. 


—> 0, and since the second term of the last difference converges to 1 in proba¬ 
te 


4.4.2 Consistency of the CMLE 

-(C) 

By the definition of the CMLE 0 1 , we have that 
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where C depends only on observable variables. 


*( u) 


Using the same procedure as that used in the case of the UMLE 0 1 we will get the double 
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>(ti) 


inequality (1331) but in terms of 9 1 instead of 6 1 and without the terms \i\[L M ult{t\)]/R\ 
and In [L V2 (ti, 0)))] /ft| T1 ^. Consequently, we will also have that 0*) C) —> 0*, 7Tx 1) (df ' j A 

7Tx^(0t)> x e O, 7 Tx' 4i) ( 0 ^) -4- 7Tx 4i ' ) (0i), x G 0_j, i = 1,... ,n, and fi^/'Ti —> 1, where 


the last result is obtained by using expression ([8]) and the same arguments as those used to 
prove that t^/ti A 1. 


4.5 Asymptotic distributions of the UMLEs and CMLEs of t\ and G\ 

4.5.1 Asymptotic multivariate normal distribution of the UMLE of ( t \ , 6 \ ) 

-1/2MI) 1/2 z(U) 


We will prove the asymptotic multivariate normal distribution of ( r. 


'i > 'i 


0i by 


proving that this estimator satisfies the conditions of Theorem 1. Condition (i) was already 
proved in the previous section. From expression © it follows that {r\ U) . §\ j satisfies con¬ 
dition (ii). Finally, by the definition of the UMLEs we have that condition (iii) is also satisfied. 


Thus, by Theorem 1, 
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implies that r, ^ 2 ffj 1 ' ] —T\ ) -4- iV(0, crfjj) and (0} (;j — 9* ) -4- i\T gi (0, Ei 22 ), where 
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E, 1 obtained by removing its first row and first column. 


4.5.2 Asymptotic multivariate normal distribution of the CMLE 0 \ and asymptotic 
normal distribution of the CMLE t [ C * 

The CMLE ( t j 1/2 t 1 (C) , -t, 1 / 2 0' j does not have an asymptotic multivariate normal distribu¬ 
tion since this estimator does not satisfy condition (iii) of Theorem 1. To see this, notice that 
by ([7]) it follows that 8 {In L u A L 0 (§f >N ) | /ddf ] = 0. Therefore, 
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By using expression ([8]) and after some algebraic steps we get that 
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From ([8]) and the fact that tTq 1 (b\ C ' ) - 4 - (0'\), it follows that the order of magnitude of 
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the first term in the curly brackets of (l36l) is (),, 
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in the curly brackets of (l36l) is O p (1); therefore (l36l) does not converge to zero in probability. 
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show that conditions (i) and (ii) of Theorem 2 are satisfied. In the previous section we proved 
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where T f 1 1 is the q\ x </\ matrix defined in the statement of Theorem 2 and S l22 is the q\ x q\ 
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vPm i-^o’ra 


00 


(1) 


00 


( 1 ) 
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From (l34l) and (l37l) it follows that a\ c ^ a\ v , and hence 1/2 (V l ‘ / i_r ij and r, l/2 (V^ — 
Ti j do not have the same asymptotic normal distribution. In addition, (1351) and (l37l) imply that 

7 ^ S l22 , and consequently that t^ 2 (df ! — 0 ^j and t^ 2 — 0 *^ do not have the 

same asymptotic normal distribution. Notice also that even though the asymptotic marginal 
distributions of the UMLEs and CMLEs of T\ and 0) are not the same, from (l37l) it follows 
that if n/N were small enough so that 1 —(1—n/iV)^^ (0*) ~ 1 — ttq 1 } (0^), then \I/i « Si 22 
and their asymptotic marginal distributions would be very similar to each other. 


4.6 Asymptotic properties of unconditional and conditional maximum 
likelihood estimators of ( 7 - 2 , 0* 2 ) 

The unconditional and conditional maximum likelihood estimators of (r 2 ,0 2 ) are exactly 
the same as those used in capture-recapture studies. Sanathanan (1972) assumed conditions 
similar to (l)-(4) and (6) and proved the following results: 

a(U) p * *(0 p 

(l) 0 2 —> 0 2 and 0 2 —> 0 2 as r 2 — > oo. 


(ii) /t 2 —> 1 and r 2 OJ /r 2 -4- 1 as r 2 —* oo. 


(iii) 


-1/2 (AU) 


To 


T 2 i T 2 




e : 


—^ Aqr 2 ( 0 , S 2 ) and 


1/2 / . (C) 


To 


^2 ,r 2 ' 


1/2 


0 


—y Nq 2 (0, S 2 ) as t 2 —> oo, 
where S 2 is the inverse of the (g 2 + 1 ) x (q 2 + 1 ) matrix S ^ 1 defined by 

[£,: 1 ] 11 =[l- I r< 2) (0D] 

[ S 2 1 ] 1 , J + 1 =[ S ^ 1 ] J+I , I = - [V’tf’ra] 


3(2) 


J+lji+1 


5] [lM 2) (02)] 0vrf(0;)/00 


,7 = 1, 

(2)1 


,92, 


3(2) 




h3 = !,•••, 92 , 


and which is assumed to be a non-singular matrix. 

Because the proofs of these results are exactly the same as those given by Sanathanan 
(1972), we will omit them. It is worth noting that unlike the CMLE [r x ~ 1 / 2 f^ ! ^ t\ 2 o [ ? j, 

the estimator (V 2 ~ 1 ' /2 f!f ) ,r 2 / 2 0 2 C) j does have an asymptotic multivariate normal distribu¬ 
tion. 
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The previous results imply that r 2 J <J ( f 2i - r 2 ) —> 7V(0, as ) and n 2 1/2 1 - r 2 ) —v 

tVin, 'j 2 j, where 


<t. 2 = 


4 2, («» + 


1 - 4 2) (0 


Vtrf (eg [e^ 


4 2, (eg i-4 2) (» 


i-tti (e 2 

Vtrf (95)1 fv4 2> (e 


-1 


Vtr< 2) (e* 


where Vttq 2> (0 2 ) is the gradient of ." ( x (e 2 ) evaluated at e 2 and E 2 : is the q 2 x y 2 submatrix 
of E^ 1 obtained by removing its first row and first column. 


4.7 Consistency and asymptotic normality of the unconditional and 
conditional maximum likelihood estimators of r = t\ + 72 

The UMLE and CMLE of r = T\ + T 2 were defined in Subsection 3.3.3 by = 
f/ 1 + and = t[ ( ] + t 2 C \ From assumptions A and B and the previous results 
we have that t^/t = (ti/t) + (t 2 /t) A cki x 1 + ck 2 x 1 = 1 , 

as ri —)■ 00 and r 2 —>■ 00. Similarly, t^/t A 1 as Ti —> 00 and r 2 —» 00. Furthermore, 
t- 1 / 2 (t ( c/ ) - r) =(r/ri) _ 1 / 2 r 1 _ 1 / 2 ^ff°- rij +(r/r 2 ) _ 1 / 2 r 2 ' 1/2 (f 2 (C/) - r 2 j -4 N( 0 ,af;), 

as T], —> 00 and r 2 —* 00, where afj = al v + a 2 cr|. Likewise, r ” 1 / 2 (f( G) — r) -4 
iV(0, cr^), where < 7 ^ = aicr 2 ^ + a 2 crf. 

5 Estimation of the matrices 1 and 

Although estimates of E^ 1 , k — 1,2, and T*, 1 can be obtained by replacing the parameters 
and 0] by their respective estimates in the expressions for these matrices, this procedure 
requires the computation of sums of 2 n terms. This is not a problem if n is small, but if n is 
large enough, say greater than or equal to 20, the number of these terms is very large and the 
calculation of the estimates of E^T 1 and T'f 1 could be computationally expensive. 

A procedure that requires a much smaller number of calculations is based on estimates 
of the vectors v{ k) , t = 1,..., r k , k = 1, 2. Vectors V) 1 ! s were defined in the proofs of 
Theorems 1 and 2, whereas vectors V, (2) s are defined in Sanathanan (1972) and we will give 
their definition later in this section. As was shown in the proofs of Theorem 1 and 2, the 
vectors Vj 1 J s are independent and equally distributed with mean vector equal to the vector 
zero and covariance matrix equal to E, 1 in the case of Theorem 1, and T*, 1 in the case of 
Theorem 2. The same result holds in the case of the vectors V) s, but the covariance matrix 
is E 2 1 . Therefore, the sample covariance matrix of the vectors Vj /, J s is an estimate of their 
covariance matrix ( E^ 1 or fE^ 1 ) based only on r k observations. 
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To implement this procedure we need to estimate the V^' -'s (they are unknown because 
depend on 6 * k and 77 .). In the case of the V^s defined in Theorem 1 an estimate V{ lj of 
vj 1 ' could be obtained by replacing 0 \ by 0 \ > in the expression for vj 1 *, and T\ could be 
estimated by t[ U \ In the case of the V^s defined in Theorem 2 estimates of Vj IJ could be 
obtained by replacing 0 \ by 6 l in the expression for V^, and t\ could be estimated by 
t\ ’ . Estimates of V) s and t 2 could be obtained as in the case of Theorem 1, and in this 
situation both UMLE and CMLE could be used. Thus, once T k and the vectors V,; s are 
obtained, their sample covariance matrix can be computed and used as an estimate of or 




-1 


The vectors v| 2) = [v} 2 \ ..., V^ +1 ]', t — 1,..., r 2 , are defined as follows: 


(a) = 1 and V t [f +1 = [7rx 2) (02)] 1 d'n { ^\6* 2 ) / dOf ] , j = l,...,q 2 , if the vector xj; 2) of 

link-indicator variables associated with the t -th element in U 2 equals the vector x e 

O - {0}; 


(b) 


T/( 2 ) _ 

v t,l — 


4 2) (»»1 / 4 2) («S and v®, = \^\ev)}-^\ev)/def. 


J = 


1 ,_, q 2 , if the vector x{ 2> of link-indicator variables associated with the f-th element 

in U 2 equals the vector 0 e Q. 


6 Conclusions 

Whenever we want to apply the results that we have obtained in this research to an actual 
situation we need to determine whether or not the assumed conditions are reasonably well 
satisfied by those observed in the actual scenario. In particular we have assumed that the 
numbers M,s of people found in the sampled sites follow a multinomial distribution with 
homogeneous cell probabilities and that the M t s go to infinity while the number of sites n in 
the sample and N in the frame are fixed. These assumptions imply that in the actual scenario 
the Mi s should be relatively large and not very variable. However, we do not know how large 
they should be so that the results can be safely used. Therefore, Monte Carlo studies are 
required to assess the reliability of the asymptotic results under different scenarios with finite 
samples and populations. In addition, although we have assumed a general parametric model 
for the link-probabilities which allows the possibility that the parameter depends or not on 
the sampled sites, the model precludes that the probabilities depend on the M,s as they go to 
infinity. Furthermore, this assumption assures that the estimators of T\ and r 2 be independent 
and not only conditionally independent given the M,s. 

An alternative asymptotic framework to the one considered in this work is to assume that 
the numbers of sites n in the sample and N in the frame go to infinity whereas the M r s are 
fixed. However, this would involve dealing with multinomial distributions with infinite num¬ 
bers of cells. An approach that could be used to derive asymptotic properties of estimators 
under this framework is the one considered by Rao (1958) who derived asymptotic properties 
of a maximum likelihood estimator of a parameter on which depend the cell probabilities of 
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a multinomial distribution with infinite number of cells. However, this is a topic of a future 
research. 
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